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We survey work on the paradigm called "computing by observing." Its central feature is that one 
considers the behavior of an evolving system as the result of a computation. To this end an observer 
records this behavior It has turned out that the observed behavior of computationally simple systems 
can be very complex, when an appropriate observer is used. For example, a restricted version of 
context-free grammars with regular observers suffices to obtain computational completeness. As a 
second instantiation presented here, we apply an observer to sticker systems. Finally, some directions 
for further research are proposed. 

1 Introduction 

In the standard input-output paradigm in Computer Science, the result of a computation is obtained by 
introducing an input to some type of system, waiting while the system processes this input, and then 
reading the produced result. For example, a Turing Machine changes the contents of its tape according 
to its program until it reaches a final state; then a specified portion of the tape contains the output. The 
process that the system runs through and by which it produces the result is in general completely ignored. 

"Computing by observing" uses a different approach: the entire process of the system is actually 
the produced result. The final or initial state by themselves are irrelevant. This approach supposes the 
presence of an external observer that is able to watch the system during its functioning and to extract a 
formal recording of its behavior Thus in this paradigm, a computing device is constructed from two main 
components: the first one called the basic or underlying system and a second one, called the observer. 
The paradigm is inspired from the procedure followed in experimental sciences. There, it is a standard 
proceeding to conduct an experiment, observe the entire progress of a system, and then take the result 
of this observation as the final output. Consider for example the use of a catalyst in a chemical reaction. 
The reaction might produce the same result, but the energy used or the time required might be less. 
This difference only becomes clear when looking at a protocol of how the quantities in question evolved 
during the course of the experiment. 

The architecture that protocols the evolution of a system in the way described is schematically de- 
picted in Figure [T] 

The approach stresses the role of the observer in computation. This role can be crucial as we will see 
in Section |2| in fact, any computational device can be obtained by observing in the right manner a fixed 
(and simple) basic system (one can then talk of "computing by only observing..."). 

In what follows we will use some basic notions and standard terminology from formal languages 
theory (see, e.g., f?']). We only mention that 3i£9 and C9" denote the classes of regular and context-free 
grammars, respectively and that by REG, CP and RE we denote the classes of languages generated by 
regular, context-free, and type-0 grammars, respectively. 

*This work was done while Peter Leupold was funded as a post-doctoral fellow by the Japanese Society for the Promotion 
of Science under number P07810. 
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Figure 1: Basic structure of the "Computing by Observing" architecture. 



2 Observed Complexity of Simple Grammars 

In this section we present grammar/observer (G/0) systems that are generative devices based on the 
"computing by observing" paradigm. In this case, a formal grammar plays the role of the observed basic 
system and the external observer is a finite automaton. This section is essentially based on Q. 

2.1 The Observers: Monadic Transducers 

A grammar's states are the sentential forms of its derivations. So for the observer we need a device 
mapping these arbitrarily long strings into just one singular symbol. We use a special variant of finite 
automata with some feature known from Moore machines or also from subsequential transducers: the set 
of states is labeled with the symbols of an output alphabet. Any computation of the automaton produces 
as output the label of the state it halts in. Because we find it preferable that the observation of a certain 
string always leads to a fixed result, we consider here only deterministic and complete automata. The 
name monadic transducer is motivated from monadic string -rewriting rules; these can have arbitrarily 
long left sides that are rewritten to strings of length one or zero. 

For simplicity, in what follows, we present only the mappings that the observers define, without 
giving a real implementation (in terms of finite automata) for them. Therefore no more formal definition 
of monadic transducers is necessary here. The class of all monadic transducers is denoted by FAq- 

2.2 Grammar/Observer Systems 

A Grammar/Observer (G/0) system is a pair Q. = (G,A) constituted by a generative grammar G = 
{N, T,S,P) and a monadic transducer (observer) A with output alphabet £ U {_L}, which then is also the 
output alphabet of the entire system Q.. The transducer's input alphabet must be the union of N and T 
from the grammar so that it can read all sentential forms. 

Several modes of generation can be defined (see, e.g., fj]). Here we will consider the mode of 
generation that admits writing an empty and non-empty output in an arbitrary manner (free G/0 systems); 
i.e., the transducer can either output one letter or the empty word and freely alternate between these two 
options. A free G/0 system generates a language in the following manner: 



Lf(a) 



{A(wo,wi,... 



W„) \ S = Wo ^ Wl ^ ■ ■ ■ ^ Wn, Wn e T*}. 



Here A(wo,>vi , . . . , w„) is used as a more compact way of writing the catenation A(wo)A(wi) • • •A(w„). 
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In other words, the language contains all those words which the observer can produce during the 
possible terminating derivations of the underlying grammar. Derivations which do not terminate do not 
produce a valid output; this means that we only take into account finite words. Of course, by considering 
the other case of non-terminating derivations the G/0 systems could also be used to generate languages 
of infinite words. 

We also consider the variant where we define the language produced by D. as 

L_Lj(n) =L/(Q)ni*. 

In this way the strings in Lf{D.) containing _L are filtered out and they are not present in L± f{Q.). Thus 
the observer has in some sense the ability to reject a computation, when configurations of a certain class 
appear. The language generated by Q. is also called observed behavior of 



The functioning of a (free) G/0 system is sketched in Example 2.1 



G = (N, T, S, P) 
N = {S, A, B, C> T = {t> 

P= { A, 

A ^ AB, A ^ C, B ^ C, C ^ t 



A(w) = < 



'A 


if w = S 


a 


if w £ AB* 


b 


if w e C*B' 


c 


if we t*C' 


1 


else 



Example 2.1 



observed behavior of the grammar {a" b" | n >0} 



To each sentential form produced by the grammar G the observer A associates a symbol that can be 
a,Z?,c,_L or the empty string A (the vertical arrow is the observer mapping). Thus every computation 
of the observer produces one output symbol and the concatenation of these symbols is then the output 
string. For instance, in the figure the output string is Xa---ab---bc---c. The mapping defined by the 
observer is specified by the regular expressions. The language Lf{Q.) is obtained by considering all 
possible halting derivations of G and collecting all the output strings. It is easy to see that in this case 
L^j{a) is {a^b^c" \n>0]. 

In [7 1 it has been shown that slG/O system composed of very simple components, namely a locally 
commutative context-free grammar (LCCF), a proper subclass of context-free grammars, and a finite 
state automaton is computationally complete. 



Theorem 2.1 For each L G RE there exists a G/0 system Q. 

L^j{a)=L.mi 



(G,A), with G an LCCF, such that 



An interesting fact is that the observer's ability to produce _L, i.e., to eliminate certain computations, 
seems to be a powerful and essential feature in all the variants explored in [Tl . Besides the free variant 
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presented here, also systems that have to write in every single step and systems that have to write in every 
step after they have started to write are investigated there. 

As can be seen from the definition, for free systems we obtain all recursively enumerable languages 
over £ simply by intersection of a language over £U {_L} with the regular language £*. Now notice that 
recursive languages are closed under intersection with regular sets. Therefore, there must exist some 
grammar/observer systems Q. generating a non-recursive Lf{Q.) (i.e., not using the filtering with _L). 
However, this intersection filters out a great many words produced from undesired computations; so 
despite being simple, this intersection distinguishes between good and bad computations. Therefore it 
seems very unlikely that the same model without this feature will be computationally complete, although 
it generates non-recursive languages. 

As discussed in the introduction, the idea of the presented framework is to stress the role of the 
observer in computations. Therefore, it is interesting to understand how much one can compute by 
making only changes in the observer, keeping the observed basic system unchanged. We show by means 
of an example that a G/0 system can generate very different languages if the observer is changed while 
the grammar remains fixed. 

Let us consider the context-free grammar G = {{S ,A, B ,C} ,{t , p} , S,{S —>■ pS,S —^p,S—^ A, A —>■ 
AS, A C,B C,C t}). If G is coupled with the observer A' such that 
A'(w) = aif w G {S,A,B,C,t,p}+, then Q. = (G,A') defines the language Lj_j{D.) = {a' \ i > 2}, a 

regular language. In fact, the derivation S ^ pS p"^^S p" produces (when observed) the string 

Keeping the same grammar G we change the observer into A" such that: 

^ X ifw = S, 
a if w GAS*, 
b ifwGC+5*, 
c ifwGf+C*, 
_L else 

In this case, one can verifies that Q. = {G,A") generates the language L±j{Q.) = {a"b"c" \ n > 0}, a 
context-sensitive language. 

This example suffices to underline that part of the computation can be done by choosing the right 
observer, keeping unchanged the underlying basic system. Actually, this part can be really crucial: 
As shown in |0| one can construct an universal context-free grammar that can generate all recursively 
enumerable languages when observed in the correct manner. 

We can also consider several restrictions on the observed system. In particular, we can bound the 
number of nonterminals in the sentential forms. 

In this respect, we notice that the universal context-free grammar used in [6] has no bound on the 
number of nonterminals in its sentential forms. 

The next result shows that indeed this is a necessary property of context-free grammars that are 
observationally complete for type-0 grammars. In fact, when a bound is imposed, the observed behaviors 
are regular. Recall that a context-free grammar is nonterminal bounded if there exists a constant k such 
that all sentential forms generated by the grammar have at most k nonterminals. Clearly, a regular 
grammar is nonterminal bounded. 

Theorem 2.2 For every G/0 system Q. = (G,A), with G nonterminal bounded context-free, Lj_ f{Q.) is 
regular ^ 



A"{w) = < 
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3 Observed Complexity of Simple BioSystems 



As we have seen in Section 2.2 the complexity of the produced output is influenced by the particular dy- 
namics of the observed system. This means that the entire trajectory followed by the observed system is 
important rather than the momentary reached states. In this section we stress the distinction between the 
complexity of the result and of the behavior by considering as observed system a sticker system (a com- 
putational model inspired by the self-assembly of DNA strands). It is known that sticker systems have a 
computational power that is less than that of regular grammars. However their behavioral complexity is 
computationally complete when observed in an appropriate manner. This means that their observed be- 
haviors can represent all possible computable languages. In other words, a basic system (sticker system) 
that is less powerful than a regular grammar, is then universal in terms of observed behaviors {this result 



clearly contrasts with Theorem 2.2 ■ observed behaviors of regular grammars are still regular). 



Sticker systems were introduced in [31 as a formal model of the operation of annealing (and ligation) 
operation that is largely used in DNA computing area. The basic operation of a sticker system is the 
sticking operation that constructs double stranded sequences out of "DNA dominoes" (polyominoes) that 
are sequences with one or two sticky ends, or single stranded sequences, attaching to each other by 
ligation and annealing. 

An observable sticker system was then introduced in 0. The idea of an observable sticker system 
can be expressed in the following way: an observer (for example, a microscope) is placed outside the 
"test tube", where (an unbounded number of copies of) DNA strands and DNA dominoes are placed 
together. Some of these molecules are marked (for example, with a fluorescent particle). The molecules 
in the solution will start to self-assemble (to stick to each other) and, in this way, new molecules are 
obtained. The observer watches the trajectory of the marked molecules and stores such evolution on an 
external tape in a chronological order. For each possible trajectory of the marked molecules a certain 
string is obtained. Collecting all the possible trajectories of such marked strands we obtain a language. 

We first recall the definition of sticker systems and we couple them with an external observer, defining 
an observable sticker system. This part is essentially based on [2J. 

Consider a symmetric relation p C V x V over V (of complementarity). Following liH, we associate 
with V the monoid V* x V* of pairs of strings. Because it is intended to represent DNA molecules, 

) and V* X as f ^* 



we also write elements {x\,X2) G x V* in the form 



We denote by 



V 
V 



{ 



a,b (^V, {a,b) G p} the set of complete double symbols, and WKp{V) 



V 
V 



the set of the complete double-stranded sequences (complete molecules) also written as 
xi is the upper strand and X2 is the lower strand. 



X2 



IS 

p 

where 



Figure 2: Sticking operation 
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As in ||5l, we use single strands - the elements of S{V) ~ y*^^ \ ^ molecules with 

S{V), from now on called 



V 
V 



(a possible) overhang on the right, which are the elements of Rp (V) 

L ' Jp 

well-Started molecules (upper and lower strand are defined as in the case of complete molecules). 

Given a well started molecule u £ Rp{V) and a single strand v G S{V), we recall in Figure |2|the 
partial operation /i : Rp{V) x S{V) — > ^p(^) of sticking, as defined in f5\. We point out that we use a 
case of sticking, restricted to pasting a single strand to the right side of a well-started molecule (with a 
possible overhang on the right), corresponding to the simple regular sticker systems. 

A (simple regular) sticker system is a construct 7 = {V,p,X,D), where X Rp{V) is the (finite) set 
of axioms, and D C S{V) is the (finite) set of dominoes (in this case these are single strands). Given 
M, w G Rp{V), we write u^wiffw = p{u,v) for some v G D. A sequence {wi)i<i<k Q Rp{V) is called a 
complete computation if w\ G X, Wi =^ w,+i for 1 < / < ^ and G WKp{V). 

The language generated by a sticker system 7 is the set of upper strands of all complete molecules 
derived from the axioms. It is known that the family of languages generated by simple regular sticker 
systems is strictly included in the family of regular languages (see [5] for the proof). 

For an alphabet V, our double-symbol alphabet constructed over V is 



V 
V 



u . u' 



p 



We define an observer A G FAq, with input alphabet Vd, that reads an entire molecule (element of RpiV)) 
and outputs one symbol from the output alphabet ZU {_L} (every well-started molecule in Rp{V) C is 
read, in a classical way, from left to right, scanning one double symbol from Vd at a time). For a molecule 
w ^Rpiy) and the observer A we write A{w) to indicate such output; for a sequence wi , . . . , w„ of « > 1 
of molecules in RpiV) we write A(wi, . . . ,Wn) for the string A{w\) • • ■A{w„). 

An observable sticker system with output alphabet ZU {_L} is a construct = (7, A), where 7 is the 
sticker system with alphabet V , and A G FAq is the observer with input alphabet Vd and output alphabet 

su{±}. 

We denote the collection of all complete computations of by 6(0). The language, over the output 
alphabet ZU {_L}, generated by an observable sticker system 0, is defined as = {A{s) \ s G 6(0)}. 
As done for G/0 systems, we want to filter out the words that contain the special symbol _L, then we 
consider the language L±{(})) = L{<p) nZ*. 

We will illustrate with a simple example how an observable sticker system works. At the same time 
this example shows how one can construct an observable sticker system with an observed behavior that is 
a non regular language despite the fact that the power of simple regular sticker systems, when considered 
in the classical way, is subregular. 

Consider the following observable sticker system (p = (7, A): 



7 = {V = {a,c,g,t},p = {{a,t), (eg), {t, a), {g,c)},X = { 



D 



X 



X J '\ t J \ X J '\ g 



X 



with the observer A defined by the following mapping: 



28 



Complexity through the Observation of Simple Systems 



A(w) 



d, 
A, 



if w G 



if w £ 



a 
t 
a 
t 



a 
X 

X 



u 



u 



otherwise. 

The language generated by / is Li = {b'"d'^ \m>n,m>l,n>0}^ REG. 
Below is an example for a computation of ^ (generating bbbbdd): 



Step 

















Added 




{I) 


{I) 


(t) 


{I) 


(t) 


(i) 


Molecule 


a 


aa 


aaa 


aaa 


aaac 


aaac 




aaac 




t 


t 


t 


tt 


tt 


ttt 




tttg 


Output 


b 


b 


b 


b 


d 


d 


X 



current molecule either 



the observer maps the result (a molecule without markers) to 



The idea of the system <j) is the following: think of symbols c, g as "markers". While we stick to the 

b. As soon as we attach to the current molecule a marker, the observer maps the resulting molecule to d, 
until the strand with a marker is extended or until the molecule is completed. 

Suppose that, when the first marker is attached, the length of the strand with that marker is h, the 
length of the other strand is I2 (clearly, li > I2), and then the output produced so far is b'^^^^^-^d. To 
complete the molecule by extending the strand without the marker, we need to attach h — 12 symbols 
to it, and in this case the observer outputs t/''^'^-!^ Thus, the resulting string x consists of I1 + I2 — 2 
b's and h — 12 d's. Since /i > 1> the difference between the number of b's and the number of d's is 
/i + /2 — 2 — (/i — 12) = 2/2 — 2 > 0. (Recall that in case we attach a symbol to a string with the marker, 
the observer only outputs X, so the inequality m = \x\h > \x\cj = n remains valid, and all the combinations 
{m,n), m>n are possible). Hence, L{y) = L\. 

To get computational completeness we do not need "complicated" sticker systems but simple regular 
sticker systems and an observer that is able to discard any "bad" evolution. 

Theorem 3.1 For each L E RE there exists an observable sticker system such that Lj_{^) = L. 



Notice that, as already remarked earlier, using Theorem 3. 1 the definition of and the fact that 

recursive languages are closed under intersection with regular languages, we obtain: 
Corollary 3.1 There exists an observable sticker system (j) such that L{<^) is a non-recursive language. 

m 

An interesting question here is why we obtain computational completeness with a regular system 
while with regular grammars the architecture is much weaker. Intuitively the reason for this is the fact 
that a grammar can only write a terminal once, while the sticker system can do so twice - once in the 
upper strand, once in the lower. Thus the workspace is rewritable in a limited manner while for a regular 
grammar the workspace is essentially only its one non-terminal. 



4 Research Perspectives 

As the surveyed results show, the paradigm of evolution and observation is indeed quite universally 
applicable to discrete systems known in Theoretical Computer Science. As soon as a simple way of 
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mapping its configurations to single letters is found, any such system can be the base of a computation 
by observation. However, not every type of system is equally adapt for achieving great computational 
power. Actually, Theorem 2.2 shows that even a decrease in power can occur. Here the crucial factor 
seems to be space that can repetitively be rewritten; if it is finite, only finite-state computations seem 
possible. 

Besides investigating further specific systems with respect to their power in an evolution-observation- 
system, it seems very interesting to try and formalize the thoughts of the preceding paragraph and to try 
to identify other such crucial factors. This could eventually lead to a type of measure for the capacity of 
systems to process information. On the other hand, limiting these crucial resources would lead to com- 
plexity hierarchies. A first result in this direction is that a linear space bound on a grammar's sentential 
form in an otherwise computationally complete model characterizes the context-sensitive languages Q. 

In this paper we have presented only generative devices. However, the paradigm can be applied also 
to construct accepting devices. In this case, one can imagine introducing an input to the basic system, 
and observe its evolution. If the evolution of the observed system is the one expected (for example 
follows a certain pattern) then the input is accepted by the system, otherwise is considered to be rejected. 
This approach has been applied to string rewriting where a characterization of context-sensitive 
languages is provided) and DNA splicing (191). 

Another possibility is to use the paradigm to formalize of the notion of "abstraction". In fact, the 
process of observation collapses the state-space of the basic system into a smaller one, doing a sort of 
abstraction on the observed system (keeping some specific information, while ignoring others). The 
observed behaviors are then "upper approximations" of the behaviors of the basic system. Two interest- 
ing problems should be investigated: (i) how to guarantee that the observed behaviors maintains some 
specific properties of the basic system (e.g., if the basic system oscillates then this is maintained in the 
observed behaviors); (ii) given the basic system and the observer, how to provide, in an algorithmically 
efficient manner, a finite description (e.g., a grammar) of the sets of observed behaviors. 

One more application of the paradigm could be in the area of quantum computation. There the 
concept of observer is crucial, and actually, a major problem in quantum experiments is to design the 
correct observer. However no notion of complexity of the external observer has been formally defined. In 
the proposed paradigm one could formalize this notion by explicitly dividing the observed system from 
the observer (111). 
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